Robust Speech Recognition by Model Adaptation and Normalization Using Pre-Observed Noise
نویسندگان
چکیده
منابع مشابه
Noise Level Normalization and Reference Adaptation for Robust Speech Recognition
This paper describes an approach to normalize the noise level of a speech signal at the outputs of the Mel scaled filter–bank used in MFCC–feature extraction. An adaptive normalizing function that distinguishes between speech and silence parts of the signal is used to normalize the noise level, without altering the speech parts of the signal. This technique is combined with an adaptation of the...
متن کاملEfficient Speaker and Noise Normalization for Robust Speech Recognition
In this paper, we describe a computationally efficient approach for combining speaker and noise normalization techniques. In particular, we combine the simple yet effective Histogram Equalization (HEQ) for noise compensation with Vocal-tract length normalization (VTLN) for speaker-normalization. While it is intuitive to remove noise first and then perform VTLN, this is difficult since HEQ perfo...
متن کاملImproved feature vector normalization for noise robust connected speech recognition
Feature vector normalization has been successfully used to improve the noise robustness of speech recognizers. Unfortunately, it may cause additional insertion errors in connected digit recognition in clean environments. We propose two methods to reduce the number of insertions. Based on estimated instantaneous signal-to-noise ratio we form a reliability measure for the recognized digits. We di...
متن کاملConstrained Spectrum Normalization for Robust Speech Recognition in Noise
This paper presents a new approach to robust speech recognition in noise based on spectral subtraction. A conventional spectral subtraction technique leads to nonlinear distortions of the normalized speech signals and resulting degradation of speech recognition accuracy. A new method is proposed to constrain spectral subtraction by imposing upper bounds on the estimates of the noise spectra. Tw...
متن کاملRapid response and robust speech recognition by preliminary model adaptation for additive and convolutional noise
Users require speech recognition systems that offer rapid response and robustness (high accuracy). Speech recognition accuracy suffers from additive noise, imposed by ambient noise, and convolutional noise, created by space transfer characteristics. Existing model adaptation techniques achieve robustness by using HMM-composition and CMN (cepstral mean normalization). Since they need the additiv...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEICE Transactions on Information and Systems
سال: 2008
ISSN: 0916-8532,1745-1361
DOI: 10.1093/ietisy/e91-d.3.422